Inducing Clause-Combining Rules: A Case Study with the SPaRKy Restaurant Corpus
نویسندگان
چکیده
We describe an algorithm for inducing clause-combining rules for use in a traditional natural language generation architecture. An experiment pairing lexicalized text plans from the SPaRKy Restaurant Corpus with logical forms obtained by parsing the corresponding sentences demonstrates that the approach is able to learn clause-combining operations which have essentially the same coverage as those used in the SPaRKy Restaurant Corpus. This paper fills a gap in the literature, showing that it is possible to learn microplanning rules for both aggregation and discourse connective insertion, an important step towards ameliorating the knowledge acquisition bottleneck for NLG systems that produce texts with rich discourse structures using traditional architectures.
منابع مشابه
Inducing clause-combining operations for natural language generation
Recent work in end-to-end generation has reduced the need for knowledgeengineering, but is insufficiently sensitive to discourse structure. We present a method for inducing clause-combining rules for use in a traditional natural language generation architecture to address this gap. Our algorithm is able to learn all of the clause-combining rules present in the SPaRKy restaurant corpus from exem...
متن کاملThe Methodius Corpus of Rhetorical Discourse Structures and Generated Texts
Using the Methodius Natural Language Generation (NLG) System, we have created a corpus which consists of a collection of generated texts which describe ancient Greek artefacts. Each text is linked to two representations created as part of the NLG process. The first is a content plan, which uses rhetorical relations to describe the high-level discourse structure of the text, and the second is a ...
متن کاملEnhancing the Expression of Contrast in the SPaRKy Restaurant Corpus
We show that Nakatsu & White’s (2010) proposed enhancements to the SPaRKy Restaurant Corpus (SRC; Walker et al., 2007) for better expressing contrast do indeed make it possible to generate better texts, including ones that make effective and varied use of contrastive connectives and discourse adverbials. After first presenting a validation experiment for naturalness ratings of SRC texts gathere...
متن کاملEvaluating Automatic Extraction of Rules for Sentence Plan Construction
The freely available SPaRKy sentence planner uses hand-written weighted rules for sentence plan construction, and a useror domain-specific second-stage ranker for sentence plan selection. However, coming up with sentence plan construction rules for a new domain can be difficult. In this paper, we automatically extract sentence plan construction rules from the RST-DT corpus. In our rules, we use...
متن کاملLearning Contrastive Connectives in Sentence Realization Ranking
We look at the average frequency of contrastive connectives in the SPaRKy Restaurant Corpus with respect to realization ratings by human judges. We implement a discriminative n-gram ranker to model these ratings and analyze the resulting n-gram weights to determine if our ranker learns this distribution. Surprisingly, our ranker learns to avoid contrastive connectives. We look at possible expla...
متن کامل